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We discuss the real-space moments of temperature anisotropies in the cosmic microwave back- 
ground (CMB) due to weak gravitational lensing by intervening large-scale structure. We show that 
if the probability distribution function of primordial temperature anisotropies is Gaussian, then it 
remains unchanged after gravitational lensing. With finite resolution, however, non-zero higher- 
order cumulants are generated both by lensing autocorrelations and by cross-correlations between 
the lensing potential and secondary anisotropies in the CMB such as the Sunayev-Zel'dovich (SZ) 
effect. Skewness is produced by these lensing-SZ correlations, while kurtosis receives contributions 
from both lensing alone and lensing-SZ correlations. We show that if the projected lensing potential 
is Gaussian, all cumulants of higher-order than the kurtosis vanish. While recent results raise the 
possibility of detection of the skewness in upcoming data, the kurtosis will likely remain undetected. 

I. INTRODUCTION 

Weak gravitational lensing deflects the paths of cosmic microwave background (CMB) photons propagating from 
the surface of last scattering. One result of this lensing is the transfer of power from large angular scales associated 
with acoustic-peak structures to small angular scales in the damping tail of the anisotropy power spectrum . This 
transfer only results in a few-percent modification of the power associated with the acoustic-peak structure, and the 
increase in power along the damping tail is significantly smaller than that generated by secondary anisotropies due 
to reionization |^ . To indentify the effect of gravitational lensing on CMB data, it is necessary to consider signatures 
beyond that in the angular power spectrum of temperature fluctuations. The existence of non-vanishing higher order 
cumulants is one such non-Gaussian signature lensing can generate. 

Since gravitational lensing conserves surface brightness, CMB fluctuations from lensing are at the second order 
in temperature fluctuations and result in non-Gaussian behavior through non-linear mode coupling. Though lens- 
ing alone does not lead to a three-point correlation function, the correlation between lensing and other secondary 
anisotropies can lead to such a contribution. This three-point correlation has been widely discussed in the literature 
in terms of its Fourier-space analogue, the bispectrum yj. Weak lensing of the primary anisotropies can produce 
a four-point correlation due to its non-linear mode-coupling nature |^-|^, as can correlations between lensing and 
secondary effects 0. When probed appropriately through quadratic statistics such as the power spectrum of the 
squared-temperature map, the trispectrum due to lensing alone can be used for a model-independent recovery of the 
projected mass distribution out to the last scattering surface 1^,^. Though these statistics have been shown to be 
interesting and potentially detectable, measurement of these Fourier-based statistics is challenging and techniques are 
still underdeveloped for this purpose. 

Here, we discuss real-space moments of the lensed CMB temperature anisotropies. Real-space statistics are easily 
measurable from data. The only drawbacks are that they are unlikely to be optimal and only provide limited knowledge 
of the full non-Gaussian aspect of the temperature distribution. The first attempts to measure non-Gaussianity in the 
COBE data relied on real-space cumulants ]lO| , as will attempts using data from its successor experiments such as 
MAP and Planck. This motivates our emphasis here on the real-space cumulants such as the skewness and kurtosis; 
we make several remarks on higher-order cumulants as well. 

As part of this calculation, we extend a previous discussion of the kurtosis due to lensing in Ref . ||^ and also consider 
effects related to correlations between lensing and secondary effects such as the Sunyave-Zel'dovich (SZ; |l^]) effect. 

Real-space moments can be derived from the one-point probability distribution function (PDF) of temperature 
fluctuations, and can conversely be used to constrain the form of this function. In the case of infinite angular 
resolution, we conclude that lensing does not modify the PDF of temperature anisotropies produced at the last 
scattering surface, which is a reflection on the fact that lensing does not create new power but rather transfers 
power from large to small angular scales. The higher-order moments are only generated in a temperature map by 
finite-resolution effects such as beam smoothing introduced either experimentally or artificially by explicit filtering. 

The paper is organized as follows. In § ^ we introduce formalism concerning the weak-lensing approximation 
and define the bispectrum, trispectrum, and corresponding higher-order quantities. The bispectrum and trispctrum 
induced in the CMB by lensing and secondary anisotropies are derived in § [III| , and some remarks are made concerning 
higher-order cumulants as well. The nonzero bispectrum and trispectrum yield a skewness and kurtosis respectively 
in the one-point distribution function of the CMB as shown in § We refer the reader to Ref. Q for additional 
details related to the effect of lensing on CMB anisotropies. Though we present a general discussion, we illustrate our 
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results in § ^ using the currently favored ACDM cosmological model with ili, = 0.05, flm = 0.35, Ha = 0.65, /i=0.65, 
and (78 = 0.9. Results for a model with as = 1.2 as suggested by CBI are also considered. 

II. LENSING CONTRIBUTION TO CMB FLUCTUATIONS 

In order to derive the effects of weak lensing on the CMB, we follow Refs. [H,D and adopt a flat sky approximation. 
As discussed in prior papers [^IJ], weak lensing remaps temperature through angular deflections along the photon 
path: 

e(n) ^ e(n + V(/)) 

= e(n) + V,0(n)V'e(n) + ^V,(j){h)Vj<j){h)V'V^e{h) + ... . (1) 

Here, 8(n) is the unlensed primary component of the CMB in direction n at the last scattering surface, G)(n) is 
the lensed map, ^(n) is the projected gravitational potential, and V0 is the lensing deflection angle. It should be 
understood that in the presence of low-redshift contributions to CMB fluctuations resulting from large-scale structure, 
the total map includes secondary contributions which we denote by O'^(n). Since the weak-lensing deflection angles 
V0 also trace the large-scale structure at low redshifts, secondary effects which are first order in density fluctuations 
correlate with the lensing deflection angles. These secondary effects include the integrated Sachs- Wolfe (ISW; [p^) 
and the SZ effects [Q. In all real cases, a noise component denoted by 0'^(n) due to finite experimental sensitivity 
must be included as well. Thus the total observed CMB anisotropy will be 0*(n) = 0(n)_f 8''(n) -I- 8"(n). In 



the following discussion, secondary anisotropics 8*^(6) will be neglected until subsection [II B while the effects of 



instrumental noise 8"(n) on the PDF are discussed in § IV. 

Taking the Fourier transform, as appropriate for a flat sky, we write 

8(1) = J dne(n)e-*' " 

= e(i)- 1 ^eiWX), (2) 

where 

L(l, 1') = 0(1 - 1') [(1 - 1') • 1'] + ^ / ^'^(1") (3) 

x0*(r + i'-i)(i".i')[(i" + i'-i).i'] . 

We define the power spectrum, bispectrum, trispectrum and the n-point correlator in Fourier space in the usual 
way: 

e 

h ■ 

e(li) . . . 8(13))^ = (27r)2fc(li23)S®(li, l2, 13) . 
e(li) . . . 8(14))^ = (27r)25D(ll234)T^(ll, I2, 13, 14) . 
e(li) . . . e(l„)\ = (27r)2fe(li...„)f,f (li, . . . , 1„) . 



(e(ii)e(i2)) =(27r)2fc(ii2)c«, 



(4) 

where li...„ = li -I- . . . -|- 1„, and the subscript c denotes the connected portion of the correlation function. We make the 
assumption that primary anisotropics at the last scattering surface are Gaussian implying that all cumulants higher 
than the power spectrum vanish: (8(li) . . . 8(1„))^ = 0, when n > 2. 

The nth cumulant of the temperature anisotropics is defined in the usual manner, 

7^"W = / -^...-^{e\h)...e'%))^wih9)...wiL9), (5) 

where 9 is the smoothing scale of the map from which the cumulants are determined, and W{19) is the smoothing 
window function. We will use Gaussian window functions throughout this paper. In general, the finite resolution of 
real CMB anisotropy experiments induces Gaussian smoothing at the angular scale of the experimental beam size. 
For infinite resolution, we take 9^0 such that W{19) 1. 
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III. POWER SPECTRUM, BISPECTRUM AND TRISPECTRUM 



Using the formalism introduced in the previous Section, we can calculate the moments of the CMB fluctuations 
generated by lensing assuming Gaussian fluctuations at the surface of last scatter. The power spectrum for the lensed 
map is 



1 



e 



The variance, or the second moment of the temperature, can be obtained following Eq. (|^) 



(6) 



(7) 



Substituting Eq. (|^) in here, we find that in the case of infinite resolution {W{19) — 1), the variance of the lensed 
temperature map coincides with that of the unlensed map. Thus, as expected, lensing conserves the total power 
associated with the temperature fluctuations. This is consistent with our basic expectation that lensing only results 
in a transfer of power from large angular scales to small angular scales. With finite resolution at levels considered 
here, the variance of the lensed temperature field differs from that of the unlensed field by a few percent at most. 

We will now discuss higher-order correlations of temperature due to gravitational lensing. We consider first contri- 
butions due to lensing alone, and then discuss additional contributions created by lensing-secondary correlations. 



A. Lensing Correlations 

We will first discuss the temperature bispectrum and show that it is zero in the absence of secondary anisotropics. 
To understand why there is no contribution to the bispectrum, consider the moments involving three temperature 
terms in Fourier space: 

(e(ii)e(i2)e(i3))^ = 

((eai)-! ^e(ii')i(ii,ii')) (e(i2)- j 0^Q{h')L{h,h')^ (^^{h)- j ^Q{h')L{h,h')^) 

= (e(li)e(l2)e(l3)) - {Q{h)Q{h) 0^Q{h')L{h, I3')) ) + Perm. 
+ (e(li)(/ 0^Q{h')L{hM')) Q |^e(l3')i(l3,l3')))+Pcrm. 

-((/ H^a^W^'i^')) (/ ||e(i.')^(i„i.')) (/ ||e(i3')m3,i3'))>. (8) 

All these terms, and the necessary permutations, involve an expectation value of three primary temperature 
anisotropies. Under our assumption of Gaussian primary temperature fluctuations, such expectation values vanish 
and thus there is no contribution to the bispectrum or the skewness. 

The trispectrum due to lensing alone can be calculated in a similar fashion. Introducing the power spectrum of 
lensing potentials, following Refs. I^jsj, we obtain the CMB trispectrum due to gravitational lensing as: 

f I2, 13, 14) - -Q^Q^ {C|tf+,3| [(li + I3) • I3] [(li + I3) • I4] + C|tf+,3| [(I2 + I3) • I3] [(I2 + I3) -14]}+ Perm. , (9) 

where the permutations now contain 5 additional terms with the replacement of (Z3, ^4) by any other pair. 

We can generalize our discussion of the power spectrum, bispectrum, and trispectrum to that of the n-point 
correlation function in Fourier space. In the absence of secondary anisotropies that correlate directly with the lensing 
potential, the n-point correlation function will vanish for odd n for the same reason that lensing alone did not generate 
a bispectrum. All such terms would involve the expectation value of an odd number of temperature fluctuations, and 
under the assumption of Gaussian primary anisotropies, such expectation values must vanish. This statement applies 
in particular to the case when measurements of non-Gaussianity are made using CMB maps which have been cleaned a 
priori of secondary fluctuations using information such as the nonthermal frequency dependence of these fluctuations. 
We will discuss the case of secondary anisotropies in the next subsection. 
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The lowest even nth correlator after the trispectrum is the six-point correlation function in Fourier space. We can 
write the portion of the connected part of this correlation function containing the lowest-order contribution in as 

e(ii) . . . e(i6))^ - ( (e{h) J |!^e(ii')L(ii, i/)) . . . (e(i3) - J -^e{h')Lih,h')^ e(i4)e(i5)e(i6)) + Perm. 
= -(/ -^eih')Lih,h')...J |^e(i3')L(i3,i3')e(i4)e(i5)e(i6)) + Pcrm. 

(10) 

Simplifying further, we see that the lowest order contribution in cj) thus involves 

(e(li) . . . e(l4))^ = C,^C,f (</)(li + l4)0(l2 + hMh + h)) [(li + I4) • I4] [(I2 + I5) • I5] [(I3 + le) • le] + Perm. (11) 

The connected part of the six-point correlation fimction in Fourier space is thus proportional to the bispectrum of 
lensing potentials. We can write 

f^{h,h, I3, 14, 15, le) = C^QfQf [B^ih + I4, 12 + 15,13 + le) [(li + I4) • I4] [(I2 + I5) • I5] [(I3 + le) • le] ] + Perm. (12) 

There are in total 120 such terms appearing in the six-point correlator when we include all permutations, coming 
from the 20 different triplets (li, Ij, Ik) and the 6 permutations of each triplet. 

We can generalize these derivations to the n-point temperature correlation in Fourier space under gravitational 
lensing. In the following, note that contributions to n-point temperature correlations in Fourier space come from 
(n/2)-point correlations in the lensing potential. We can thus write the connected part of the n-point temperature 
correlator, when n > 2, as 



T^(li, . . . , 1„) — 

'71/2 + 1 hi 



^«/2(ll + kn/2) + l, ■ • • , lri/2 + l«)(ll + l(«/2) + l) ' l(n/2) + l ■ • ■ (l«/2 + In) ' In + Perm. , (13) 



where ^^(11, . . . ,1^) is the n-point correlator of the lensing potential in Fourier space. The permutations here now 
involve n!/(n/2)! terms corresponding to the replacement of (/(,i/2)+i, ^n) with one of the other n!/[(n/2)!(n/2)!] 
combinations and the (n/2)! permutations of each combination. As we have discussed, note that T,f (li, . • . ,lri) = 
when n is odd. 

In the limit that the lensing potentials are Gaussian distributed, T^{h, ■ • ■ , In) = when n > 2. Thus, lensing of 
CMB anisotropics can only generate a trispectrum and, with smoothing, a kurtosis. The non-Gaussianity associated 
with the large-scale structure, however, will induce non-Gaussian contributions to the distribution of projected poten- 
tials such that T'^(li, . . . , 1„) 7^ for some n. Since large-scale structure most efHciently lenses the CMB at redshifts 
close to 3, where the non-Gaussianity is mild, we ignore the higher-order correlations of lensing potentials and only 
consider the dominant power spectrum, Cf"^, which contributes to the trispectrum only. 

Although theoretical predictions are made in terms of ensemble-averaged correlation functions, observationally we 
have access to only one realization of the CMB and one realization of the large-scale structure. The arbitrariness of the 
observed realization of the large-scale structure induces additional cosmic variance beyond that normally associated 
with the surface of last-scatter. One consequence is that when measured on a small patch of the sky, the observed 
two-point correlation function of the lensed map is more anisotropic than that of the unlensed map, though isotropy 
holds when a sufficiently large region of the sky is considered. The excess anisotropy is induced by cosmic shear, 
and allows us to reconstruct the lensing deflection angle from quadratic maps involving the CMB temperature and 
polarization While we emphasize one-point statistics in this paper, a more detailed account of how higher-order 
statistics probe the local anisotropy induced by lensing may prove fruitful in the future. 



B. Lensing-Secondary Correlations 

The above discussion applies to the case where other secondary fluctuations do not contribute to temperature 
anisotropics. In practice, such a situation can be achieved when thermal CMB fluctuations are separated from 
dominant secondary effects like the SZ contribution. In experiments where this is not possible, say due to a lack 
of multifrequency data, additional non-Gaussianities will be present in the CMB map due to correlations between 
lensing potentials and the secondary anisotropics. The most significant of these contributions is to the three-point 
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correlation function. We can calculate this by replacing the 6(1) terms in Eq. ^ with 0*(1). By assumption, Gaussian 
instrumental noise cannot generate a bispectrum, and as shown above neither does lensing alone. The total observed 
bispectrum is therefore that due to lensing-secondary correlations QJ^ 

I2, 13) = -Cf; [Qf (I2 • li) + C«(l3 • li)] + Perm. , (14) 

where permutations involve two additional terms with the replacement of li with I2 and I3. Here, Cf^ is the power 
spectrum describing correlations between secondary anisotropics and the lensing potential generated by large-scale 
structure. These correlations were discussed in detail in Ref. Q| where it was found that the most significant correlation 
is the one between lensing potentials and the SZ effect. We will use this correlation in illustrating our results. 

The presence of secondary effects also modifies the trispectrum and generates an additional contribution beyond 
the one discussed in Eq. (|^). Following Ref. we can write this contribution as 

T«^(ll, 12,13,14) = {C«(l3 • ll)(l4 • ll) + C«(l3 • l2)(l4 • I2) 

+ [I3 • (ll + 13)] [I4 • (I2 + 14)] q?,+i3| + [I4 • (ll + 14)] [I3 • (I2 + 13)] } 

-|-Perm. (15) 

where permutations involve five additional terms involving the pairings of (Z3, ^4). 

Due to an increase in terms as one goes to higher order, we failed to obtain a general expression for the ri-point 
correlator of temperature fluctuations in Fourier space due to lensing-secondary correlations. As we will soon discuss, 
cumulants beyond the skewness are unlikely to be important as we find kurtosis to be undetectable even for a perfect 
experiment with no noise and all-sky observations. We expect this to hold true even when considering higher-order 
moments beyond the kurtosis. 



IV. SKEWNESS AND KURTOSIS 



A simple way to identify the non-Gaussianity induced in the CMB by gravitational lensing is to measure the higher- 
order cumulants of its one-point probability distribution function Pobs(0';^) smoothed with bcamwidth 0. This 
observed one-point probability distribution function (PDF) is actually a convolution of the signal PDF Psig{&^^^;9) 
with the noise PDF Pnoise(0";^) as described below, where B**'^ = + 8*^ is the total of both lensed primary and 
secondary contributions to the signal. The signal PDF can be expressed in terms of its cumulants, which we now 
proceed to calculate. The third and fourth cumulants are proportional to the dimensionless quantities known as the 
skewness, S, and the kurtosis, K, respectively: 

s{0) = [<j{e)r^ J (e'''s)3p^.^(0sig.^) ^Qsig^ 
K{6) = [a{e)]-^ j (e^'s)4p^.^(esig.^^ ^Qsig _ 3_ 

(16) 

They can be expressed as integrals over the bispectrum and trispectrum derived in the preceding Section according 
to Eq. d): 

^(^^^ / (^(|^^(2'r)'<5D(li23)S'(li,l2,l3)M^(;ie)W^(?2^)W^(Z3^^), 

^^^^ = / (S (fj)'2 (2^)''^D(ll^34)T*(ll,l2,l3,l4)W^(/lg)^(/2g)W^(?3g)W^(^4g). 

(17) 

Inserting Eqs. (p^, (||), and (^5|) into the above expressions, and adopting a Gaussian window function W(19) = 
e-(''^i.)V2 with ab = 6'/V81n2, we obtain: 

S{e) = (2,)2[^(g)]3 / ^?d^i^id^2C«C7f;/i(a,2?iZ2)e--^('?+'^), 
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12 



(27r)3[a(0)]4 
12 

(2^0>W 



dh If Cl^e 



J dh llcZh{alhl2)e~^"^'- 



2,2 
b 'l 



dh llCl'h{cTlhh)e-"^' 



1 

2-K 



^ / lldhlldhdvCt^Cl'h[alh^ll + ll + 2hhcos^] ^-lUi+i^+hhcos^ 



h cos f + h cos^ (f 
V^i + ^3 + 2'i^3 cosyj 



(18) 

Here /i (x) is a modified Bessel function of the first kind. 

In the presence of instrumental noise, the observed one-point probabifity distribution function (PDF) will be a 
convolution of the signal PDF characterized by the skewness and kurtosis given above, and a Gaussian noise PDF: 
Pobs(0') = / ^'sig(''")^'noisc(0' "''")■ In Order to perform this convolution we must first determine the explicit form 
of the signal PDF that will have nonzero skewness or kurtosis, but vanishing higher cumulants. To do this, we follow 
the formalism discussed in Rcf. |13| and references therein. The PDF of a random variable 6 with zero mean and 
variance can be expressed as a Gram-Charlier series in the normalized variable v = 6/ a: 



p{v) = c,<j>{v) + ^0(i)(^^) + -^_<l^^^\v) + . . . , 



(19) 



where (j){v) = (27r) ^/^e "^^/^ is a Gaussian distribution. The ^i^^^v) are derivatives of the Gaussian distribution with 
respect to v: 

d\ 



and the Hi{i') are Hermite polynomials with the unconventional normalization. 



p{i')i'^ dv , 



The central moments of the PDF are defined as 



111 = cr' 



while the cumulants or "connected" portions of these moments can be derived from the relation 

ln(e*'') 



Ml 



dt' 



(20) 



(21) 



(22) 



(23) 



Using the expansion ( |19D and the orthogonality relation (|2l|), the coefficients of the Gram-Charlier series can be 
expressed in terms of the central moments. By inverting Eq. (|2^), the central moments can then be reexpressed in 
terms of cumulants. As discussed in the previous Section, the assumption that the lensing potential is Gaussian implies 
that all cumulants of higher order than the kurtosis must vanish. Using this result, we can rewrite the Gram-Charlier 
expansion as a power series in the skewness S or kurtosis K, which in the case of lensing will be small quantities. 



p{v) = 
p{v) = 



1 + ^^^3(^) 



10 

¥ 

35 



(j^iiy). 



(24) 



These power series can be convolved with a Gaussian noise PDF of variance o',joiso(^) obtain the observed PDF 
Pobs(0*)- To linear order in the true skewness and kurtosis, we find: 



5obs(e) = s{e) 



3/2 



8 \ [aHO) + al^UO)f 



(25) 
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As expected, the observed skewness and kurtosis converge to the signal values in the absence of noise and to zero in 
the case when the Gaussian noise is dominant. To actually observe skewness or kurtosis in an experimental sky map, 
we must construct estimators for these quantities using our data points, the N = in f sky /n {6/ 2)'^ pixels in the map. 
We can write estimators for the skewness and kurtosis as 



1=1 



-I 2 



(26) 



where x = 'J2i=i the traditional estimator for the mean of a distribution. For a distribution like that of the 
CMB anisotropics which is a priori defined to have a zero mean, we find: 



(27) 



These are biased estimators, as has been noted elsewhere under a different context but in the large- iV limit they 
converge to the desired quantities. Assuming that the underlying PDF is Gaussian, the variance of these estimators 
to lowest order in f/A'^ is given by: 



3! 



N 



(28) 



An alternate derivation of these variances can be obtained from the explicit form of the PDFs following Eq. (|2J). 
If A^ pixels or data points are collected and binned such that pi is the probability that a data point will fall within 
bin i and ai is the standard deviation of that probability, then the best variance of a parameter e characterizing the 
PDF is given by the Cramer- Rao bound , 



^2 i j ^2 • 



(29) 



If the error on each bin is assumed to be Poisson, then af = Pi/N. In the limit of a continuous PDF, pi — > p(i') dv 
and the discrete sum (E3) becomes an integral: 



p ^ dv . 



(30) 



Inserting Eq. (g4|) into Eq. ( |30D under the Gaussian null hypothesis 5' = if = 0, we find lowest attainable errors 
as CTg = 3!/A, 4!/Af for e = 5*, AT in agreement with the explicit calculation of the variance of our estimators noted 



in Eq. (|28[). Further discussion of the variance associated with different estimators for the skewness and kurtosis is 
included in the Appendix. 



V. RESULTS & DISCUSSION 



A. Skewness 



We illustrate in Fig. (|I]) our results for skewness due to the correlation between lensing and the SZ effect. We 
calculate this correlation following Ref. ll^] using the halo approach to large-scale structure [M. The skewness 
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FIG. 1. Left: The skewness due to lensing-SZ correlations for a perfect (no-noise) experiment (solid line), Planck (dashed 
line), and MAP (dotted line) for erg = 0.9. The CBI Icr upper bound of erg < 1-2 leads to a higher value for the skewness 
as indicated by the dot-dashed line. Right: The signal-to-noise ratio for the detection of skewness in CMB data with curves 
labeled as in the left figure. We assume full sky-coverage; for partial sky coverage the signal-to-noise ratio scales as y/ /sky , 
where /sky is the fraction of sky covered. 



approaches zero at small values of the smoothing scale, consistent with our conclusion that no non-Gaussian signatures 
exist in the PDF in the limit of infinite resolution. As shown, skewness due to the lensing-SZ correlation peaks at 
an angular scale of tens of arcminutes, which is in the range of interest to upcoming experiments such as MAP and 
Planck. When calculating expected signal-to-noise ratios for these experiments, we use detector sensitivities and 
resolutions tabulated in Ref. . For simplicity, we combine information from individual frequency channels to form 
one estimate of temperature with an overall noise given by inversely weighting individual noise contributions. 

The skewness as shown has signal-to-noise ratios slightly less than unity suggesting that its detection may be hard 
and potentially affected by noise. However, recent small-scale excess-power detections by experiments such as CBI 
raise the possibility that we may have underestimated the lensing-SZ correlation and thus the skewness. The 
lensing-SZ power spectrum Cf^ is roughly proportional to the fifth power of erg, the standard deviation of linear mass 
fluctuations within an 8h~'-^ Mpc sphere. If we adopt the CBI la upper bound of as < 1.2 [|l^ as opposed to the 
value as — 0.9 suggested by previous studies, our signal increases by a factor of 4.21. In this case, Planck could 
conceivably detect skewness with a signal-to-noise of 2.5. The potential for detection of the temperature skewness is 
consistent with previous expectations that the temperature anisotropy bispectrum due to lensing-SZ correlation can 
be detected in future data Q. The cumulative signal-to- noise for skewness, however, is significantly smaller than that 
for the full bispectrum because the skewness is a single number while the bispectrum contains all information related 
to non-Gaussianities at the three-point level. As described below, we find a similar reduction in signal-to noise for 
kurtosis when compared to the full trispectrum. 

The frequency dependence of the SZ effect allows us to construct an SZ map of the sky as well as a temperature 
map with the SZ effect removed. This provides us a unique opportunity to test our understanding of non-Gaussianity 
at the three-point level. If skewness is purely a consequence of lensing-SZ correlations as posited in this paper, then 
the skewness obtained by combining one measurement of the SZ map with two measurements of the SZ-cleaned 
temperature map at the same location using the estimator in Eq. ( ^6|) should be precisely one third that produced by 
three measurements of the total anisotropy map. This corresponds to the fact that the composite map will sample 
only one of the three permutations appearing in Eq. 
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FIG. 2. Left: The kurtosis K'^'^ due to lensing autocorrelations and K'^^ due to lensing-SZ cross-correlations for a perfect 
(no- noise) experiment (solid line) and Planck (dashed line) . The kurtosis due to lensing-SZ correlations is negative at smoothing 
scales below the kink at ~ 8 arcminutes and positive thereafter; its absolute value is shown here. Right: The signal-to-noise 
ratio for the detection of kurtosis in CMB data with curves labeled as in the left figure. We assume full sky-coverage; for partial 
sky coverage the signal-to-noise ratio scales as yfsky, where /sky is the fraction of sky covered. 



B. Kurtosis 



Both lensing kurtosis K'^'f' and the kurtosis K"^^ due to lensing-SZ correlations are undetectable even for a perfect 
no-noise experiment as illustrated in Fig. (||). Since the cumulative signal-to-noise ratio for K'^^ is well below one, 
we expect it to remain undetectable despite any uncertainty in our calculation of the SZ effect. Note our prediction 
of the lensing kurtosis K"^"^ is likely to be more certain since it only depends on the matter power spectrum, with 
contributions coming mainly from the linear regime. Thus, uncertainties in non-linear aspects of clustering are unlikely 
to affect our conclusion. 

The signal-to-noise value for K'^'t' can be compared to the cumulative signal-to-noise ratio for the direct detection 
of the full trispectrum due to lensing, which in the case of Planck can be as high as ~ 55 (|] . Consequently, although 
the lensing kurtosis cannot be detected directly from the data, lensing effects associated with this kurtosis can 
be used to reconstruct the lensing deflection angle as described in Refs. [||j9j, again with cumulative signal-to-noise 
ratios significantly greater than that for the kurtosis itself. The higher signal-to-noise ratio in lensing reconstruction is 
possible for two reasons. Unlike the kurtosis, which averages indiscriminately over all configurations of the trispectrum 
as shown in Eq. ([l7|), lensing reconstruction is sensitive to certain configurations of the trispectrum, mainly those 
that contribute to the power spectrum of squared temperature. This avoids severe positive-negative cancellations 
that significantly reduce the signature of non-Gaussianity. Secondly, the noise contribution associated with lensing 
reconstruction is also a priori reduced through a filter which is designed to extract information on the lensing potentials 
optimally. 

The low signal-to-noise associated with the kurtosis is also consistent with the fact that real-space moments, in 
general, suffer from excess noise. Though such statistics are easily measurable in data, they do not provide the 
most optimal methods to search for the existence of non-Gaussian signatures. While we recommend construction of 
cumulants such as skewness and kurtosis as a first step in understanding non-Gaussianity from effects such as lensing, 
we suggest that full measures of quantities such as bispectrum and trispectrum will be necessary to fully understand 
the non-Gaussian behavior of lensing. If measurement of such statistics are still cumbersome, we suggest the use 
of quadratic statistics in real space, such as the squared-temperature-temperature and the squared-temperature- 
squared-temperature pj power spectra which probe certain configurations of the bispectrum and trispectrum. 
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APPENDIX A: VARIANCE OF SKEWNESS AND KURTOSIS ESTIMATORS 



A question arose during the composition of this paper as to the appropriate variance for estimates of the skewness 
and kurtosis of a Gaussian distribution. The true skewness and kurtosis of a Gaussian distribution are necessarily 
zero, but given N data points Xi drawn from this distribution even unbiased estimators will yield results distributed 
about zero with some variance. Some sources (e.g^ [^) indicate variances of 15/iV and 96/iV respectively for the 
skewness and kurtosis estimators defined in Eq. (Eq) as opposed to our values of and 24/iV. This discrepancy 
prompted us to investigate further. The estimators of Eq. (|2^) differ from those given in Ref. in that they are 
estimators for the third and fourth cumulants rather than the dimensionless skewness and kurtosis to which they are 
proportional. Assuming an underlying Gaussian distribution with a variance of unity, standard propagation of errors 
reveals that the two pairs of estimators have the same variances to lowest order in However, the nai've estimators 



_^ ^ N 

5*0-3 = _ and 



1=1 

N 



N r 1 ^ 

TV ^ ' N ^ 



(Al) 



do indeed have variances of 15/iV and %Q/N for skewness and kurtosis respectively. We show this explicitly for the 
nai've skewness estimator Sa^ . The ensemble average of this estimator is simply Sa^ so it is truly an unbiased 
estimator for the skewness. However, taking the ensemble average of [Sa'^ )^ we find 

1 



3 



N 



M6 + {N- l)S'a 



(A2) 



leading to a variance 



3'\2 



N 



(M6 - S'a^) 



(A3) 



For a Gaussian distribution, /xg = 15cr^ and 5 = 0, implying that this estimator measures skewness with a variance 
of and is therefore less sensitive than S'cr3 defined in Eq. ( p6| ) which was shown to have a variance of 6/iV. An 

entirely analogous calculation shows that the naive kurtosis estim ator in Eq. ( Al) has a variance of 96/iV, not 2A/N . 
Why do the estimators of Eq. (Eq) outperform those of Eq. (Al)? Although the true mean of the underlying 



Gaussian distribution has been chosen to be zero, the estimated mean x ^ X]i=i of ^ data points will not 



necessarily vanish. The more sophisticated estimators of Eq. (|2^) take this into account by subtracting the estimated 
mean from each data point, and are therefore able to provide lower-variance estimates of the skewness and kurtosis. 
These lower values for the variances are adopted for all results concerning signal-to-noise mentioned in this paper. 
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